Skip to content

nova05: follow up changes#764

Open
bogdando wants to merge 5 commits into
openstack-k8s-operators:mainfrom
bogdando:nova05epsilon
Open

nova05: follow up changes#764
bogdando wants to merge 5 commits into
openstack-k8s-operators:mainfrom
bogdando:nova05epsilon

Conversation

@bogdando

@bogdando bogdando commented Jun 3, 2026

Copy link
Copy Markdown
Contributor
  • Add cross-site routes/GWs

    For all nova05epsilon's NADs, NetConfig subnets, host-level NNCP routes add missing cross-site DCN routes,
    and gateways.

    Previsourely those were tested in ci-framework jobs overlays but the base DT example was missing those
    CHANGEME templates.

  • Add SNO Swift fix hook for Ceph RGW

  • Ext net for tests accessing VMs via FIP

    Add external NAD definition with VLAN to DT nncp/values.yaml.
    Update scenario-vars.yaml with external NAD for Tempest access to FIP.

  • Add automation mocks for CI

  • Fix telemetry spec and kustomize chain for SNO

    The DT-level kustomization was missing service-values replacements
    (telemetry, nova, glance, ovn, extraMounts), so the examples kustomize
    build produced base defaults instead of DT-specific config. This caused
    metricStorage.enabled=false and missing alertmanagerConfig on the
    deployed OSCP.

    Additionally, pvcStorageClass was hardcoded to "local-storage" but SNO
    uses LVMS ("lvms-local-storage"), and alertmanager defaulted to 2
    replicas which cannot schedule on a single node due to required
    pod anti-affinity.

  • Align the kustomization layers with nova04delta:

    • dt/nova/nova05epsilon/kustomization.yaml: add service-values
      replacements matching nova04delta pattern
    • Remove deplicated replacements for
      dt/nova/nova05epsilon/control-plane/kustomization.yaml
    • examples/.../control-plane/service-values.yaml: add full DT-specific
      data (telemetry, nova, glance, ovn, extraMounts)
    • Set pvcStorageClass to lvms-local-storage and alertmanagerConfig
      replicas to 1 for SNO compatibility
    • update control-plane.md
  • Separate pre and post ceph OSCP configs

    HCI uses edpm post ceph to also patch the OSCP which looks suboptimal.

    Separate this into a two steps:

    • make post-ceph nodeset layer handling EDPM resources only (NodeSet,
      Services, Secrets, ConfigMap) - remove OSCP layer resources
    • add control-plane-post-ceph layer for OSCP with Glance RBD + Ceph
      extraMounts and secret config. Instead of re-including lib/control-plane
      (which required network-values duplication)
      provide only required values for lib/control-plane/base.
    • update README.md stages list from 4 to 6 steps, automation step
      count from 9 to 10
    • add note for defferred Ceph config for control-plane.md
    • add control-plane post-ceph doc for the dedicated Ceph
      control-plane update stage
    • remove control-plane update references for
      dataplane-post-ceph.md, add prerequisite for
      control-plane-post-ceph
    • add control-plane-post-ceph stage between pre-ceph deployment and
      post-ceph nodeset ini automation vars

Depends-on: openstack-k8s-operators/ci-framework#3773
Related: OSPRH-30706

@openshift-ci openshift-ci Bot requested review from karelyatin and raukadah June 3, 2026 14:51
@bogdando bogdando changed the title nova05: add cross-site routes/GWs and annotations nova05: add cross-site routes/GWs and annotations and SNO Swift hook fix for Ceph RGW Jun 3, 2026
@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/rdoproject.org/buildset/748123c85aff449bbb63c8ec67a2a576

✔️ noop SUCCESS in 0s
rhoso-architecture-validate-nova05epsilon FAILURE in 4m 10s

@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/rdoproject.org/buildset/562686cbdf7d4c7eaa0d291a61b28ee8

✔️ noop SUCCESS in 0s
rhoso-architecture-validate-nova05epsilon FAILURE in 3m 54s

@bogdando bogdando changed the title nova05: add cross-site routes/GWs and annotations and SNO Swift hook fix for Ceph RGW nova05: follow up changes for tempest and ansibleTest/gpu-validation Jun 5, 2026
@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/rdoproject.org/buildset/3bc80cde7ec544b9ae1c3c5e6ecee80a

✔️ noop SUCCESS in 0s
rhoso-architecture-validate-nova05epsilon FAILURE in 4m 04s

@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/rdoproject.org/buildset/30d9674ca6024d4c9a9f6e49b146629a

✔️ noop SUCCESS in 0s
rhoso-architecture-validate-nova05epsilon FAILURE in 4m 04s

@bogdando bogdando force-pushed the nova05epsilon branch 2 times, most recently from ca9f9f8 to 849f177 Compare June 5, 2026 15:06
@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/rdoproject.org/buildset/44007c1c904d448dad27bbdda99d79ef

✔️ noop SUCCESS in 0s
rhoso-architecture-validate-nova05epsilon FAILURE in 4m 48s

@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/rdoproject.org/buildset/45ec0f2752f549cca4bd5da30b14c175

✔️ noop SUCCESS in 0s
rhoso-architecture-validate-nova05epsilon FAILURE in 4m 40s

@bogdando bogdando force-pushed the nova05epsilon branch 2 times, most recently from 2a291a0 to 7285cb6 Compare June 6, 2026 05:06
@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/rdoproject.org/buildset/4de996b4280645dc8717c425be0814ab

✔️ noop SUCCESS in 0s
rhoso-architecture-validate-nova05epsilon FAILURE in 4m 16s

@fultonj

fultonj commented Jun 8, 2026

Copy link
Copy Markdown
Contributor

Perhaps add depends-on: https://github.com/openstack-k8s-operators/ci-framework/pull/3773?

We cannot merge this until the above provides ../../hooks/playbooks/fix_swift_endpoint.yml

@fultonj fultonj added the do-not-merge Changes are not ready to be merged label Jun 8, 2026
@fultonj fultonj requested review from abays and fmount and removed request for karelyatin and raukadah June 8, 2026 11:09
@bogdando bogdando force-pushed the nova05epsilon branch 4 times, most recently from 0b386fc to 56d2519 Compare June 10, 2026 17:13
@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/rdoproject.org/buildset/a7e81092760f4f57834ed14885bb557d

✔️ noop SUCCESS in 0s
rhoso-architecture-validate-nova05epsilon FAILURE in 5m 23s

@bogdando bogdando force-pushed the nova05epsilon branch 4 times, most recently from 8d18c89 to 1fb62b1 Compare June 11, 2026 12:49
bogdando added 2 commits June 11, 2026 17:59
Signed-off-by: Bohdan Dobrelia <bdobreli@redhat.com>
The DT-level kustomization was missing service-values replacements
(telemetry, nova, glance, ovn, extraMounts), so the examples kustomize
build produced base defaults instead of DT-specific config. This caused
metricStorage.enabled=false and missing alertmanagerConfig on the
deployed OSCP.

Additionally, pvcStorageClass was hardcoded to "local-storage" but SNO
uses LVMS ("lvms-local-storage"), and alertmanager defaulted to 2
replicas which cannot schedule on a single node due to required
pod anti-affinity.

Align the kustomization layers with nova04delta:
- dt/nova/nova05epsilon/kustomization.yaml: add service-values
  replacements matching nova04delta pattern
- Remove deplicated replacements for
  dt/nova/nova05epsilon/control-plane/kustomization.yaml
- examples/.../control-plane/service-values.yaml: add full DT-specific
  data (telemetry, nova, glance, ovn, extraMounts)
- Set pvcStorageClass to lvms-local-storage and alertmanagerConfig
  replicas to 1 for SNO compatibility
- update control-plane.md

Generated-by: claude-4.6-opus-high
Signed-off-by: Bohdan Dobrelia <bdobreli@redhat.com>
@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

This change depends on a change that failed to merge.

Change openstack-k8s-operators/ci-framework#3773 is needed.

HCI uses edpm post ceph to also patch the OSCP which looks suboptimal.

Separate this into a two steps:
  * make post-ceph nodeset layer handling EDPM resources only (NodeSet,
    Services, Secrets, ConfigMap) - remove OSCP layer resources
  * add control-plane-post-ceph layer for OSCP with Glance RBD + Ceph
    extraMounts and secret config. Instead of re-including
    lib/control-plane (which required network-values duplication)
    provide only required values for lib/control-plane/base.
  * update README.md stages list from 4 to 6 steps, automation step
    count from 9 to 10
  * add note for defferred Ceph config for control-plane.md
  * add control-plane post-ceph doc for the dedicated Ceph
    control-plane update stage
  * remove control-plane update references for
    dataplane-post-ceph.md, add prerequisite for
    control-plane-post-ceph
  * add control-plane-post-ceph stage between pre-ceph deployment and
    post-ceph nodeset ini automation vars

Signed-off-by: Bohdan Dobrelia <bdobreli@redhat.com>
@bogdando

Copy link
Copy Markdown
Contributor Author

Finally works with my local testing

@bogdando

Copy link
Copy Markdown
Contributor Author

downstream testing completed for this DT and dependency patches, it is good to go

@bogdando bogdando removed the do-not-merge Changes are not ready to be merged label Jun 17, 2026

@evallesp evallesp left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

/lgtm

@openshift-ci

openshift-ci Bot commented Jun 17, 2026

Copy link
Copy Markdown

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: bogdando, evallesp
Once this PR has been reviewed and has the lgtm label, please assign leifmadsen for approval. For more information see the Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

Comment thread examples/dt/nova/nova05epsilon/control-plane-post-ceph/network-values.yaml Outdated
@openshift-ci openshift-ci Bot removed the lgtm label Jun 17, 2026
@openshift-ci

openshift-ci Bot commented Jun 17, 2026

Copy link
Copy Markdown

New changes are detected. LGTM label has been removed.

@centosinfra-prod-github-app

Copy link
Copy Markdown
Contributor

Build failed (check pipeline). Post recheck (without leading slash)
to rerun all jobs. Make sure the failure cause has been resolved before
you rerun jobs.

https://gateway-cloud-softwarefactory.apps.ocp.cloud.ci.centos.org/zuul/t/rdoproject.org/buildset/9419fc3e879947b5845fee2b24cbee92

✔️ noop SUCCESS in 0s
rhoso-architecture-validate-nova05epsilon FAILURE in 4m 32s

Signed-off-by: Bohdan Dobrelia <bdobreli@redhat.com>
@bogdando

Copy link
Copy Markdown
Contributor Author

@abays the deduplication patch worked for my testing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants